AITopics

2404.02466

Country:

Asia > Singapore (0.05)
North America > Canada > Ontario > Toronto (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance > Trading (0.70)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Salinas, Abel, Morstatter, Fred

The Butterfly Effect of Altering Prompts: How Small Changes and Jailbreaks Affect Large Language Model Performance

arXiv.org Artificial IntelligenceJan-9-2024

Large Language Models (LLMs) are regularly being used to label data across many domains and for myriad tasks. By simply asking the LLM for an answer, or ``prompting,'' practitioners are able to use LLMs to quickly get a response for an arbitrary task. This prompting is done through a series of decisions by the practitioner, from simple wording of the prompt, to requesting the output in a certain data format, to jailbreaking in the case of prompts that address more sensitive topics. In this work, we ask: do variations in the way a prompt is constructed change the ultimate decision of the LLM? We answer this using a series of prompt variations across a variety of text classification tasks. We find that even the smallest of perturbations, such as adding a space at the end of a prompt, can cause the LLM to change its answer. Further, we find that requesting responses in XML and commonly used jailbreaks can have cataclysmic effects on the data labeled by LLMs.

apple, sentiment, variation, (17 more...)

2401.03729

Country:

North America > United States > Oregon (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
Asia > Indonesia > Bali (0.04)
Asia > China (0.04)

Genre: Research Report (0.83)

Industry:

Education (0.46)
Media (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Zhang, Bohui, Reklos, Ioannis, Jain, Nitisha, Peñuela, Albert Meroño, Simperl, Elena

Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata

arXiv.org Artificial IntelligenceSep-15-2023

In this work, we explore the use of Large Language Models (LLMs) for knowledge engineering tasks in the context of the ISWC 2023 LM-KBC Challenge. For this task, given subject and relation pairs sourced from Wikidata, we utilize pre-trained LLMs to produce the relevant objects in string format and link them to their respective Wikidata QIDs. We developed a pipeline using LLMs for Knowledge Engineering (LLMKE), combining knowledge probing and Wikidata entity mapping. The method achieved a macro-averaged F1-score of 0.701 across the properties, with the scores varying from 1.00 to 0.328. These results demonstrate that the knowledge of LLMs varies significantly depending on the domain and that further experimentation is required to determine the circumstances under which LLMs can be used for automatic Knowledge Base (e.g., Wikidata) completion and correction. The investigation of the results also suggests the promising contribution of LLMs in collaborative knowledge engineering. LLMKE won Track 2 of the challenge. The implementation is available at https://github.com/bohuizhang/LLMKE.

knowledge, python list, relation, (13 more...)

2309.08491

Country:

North America > Puerto Rico > Peñuelas > Peñuelas (0.04)
South America > Brazil (0.04)
North America > Canada > Ontario > Toronto (0.04)
(7 more...)

Genre:

Personal > Honors (1.00)
Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Nayak, Anmol, Timmapathini, Hari Prasad

LLM2KB: Constructing Knowledge Bases using instruction tuned context aware Large Language Models

arXiv.org Artificial IntelligenceAug-25-2023

The advent of Large Language Models (LLM) has revolutionized the field of natural language processing, enabling significant progress in various applications. One key area of interest is the construction of Knowledge Bases (KB) using these powerful models. Knowledge bases serve as repositories of structured information, facilitating information retrieval and inference tasks. Our paper proposes LLM2KB, a system for constructing knowledge bases using large language models, with a focus on the Llama 2 architecture and the Wikipedia dataset. We perform parameter efficient instruction tuning for Llama-2-13b-chat and StableBeluga-13B by training small injection models that have only 0.05 % of the parameters of the base models using the Low-Rank Adaptation (LoRA) technique. These injection models have been trained with prompts that are engineered to utilize Wikipedia page contexts of subject entities fetched using a Dense Passage Retrieval (DPR) algorithm, to answer relevant object entities for a given subject entity and relation. Our best performing model achieved an average F1 score of 0.6185 across 21 relations in the LM-KBC challenge held at the ISWC 2023 conference.

instruction, language model, relation, (14 more...)

2308.13207

Country:

Asia > Thailand (0.05)
Asia > Laos (0.05)
Asia > China (0.05)
(4 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJun-9-2023

ChatGPT: Jack of all trades, master of none

Kocoń, Jan, Cichecki, Igor, Kaszyca, Oliwier, Kochanek, Mateusz, Szydło, Dominika, Baran, Joanna, Bielaniewicz, Julita, Gruza, Marcin, Janz, Arkadiusz, Kanclerz, Kamil, Kocoń, Anna, Koptyra, Bartłomiej, Mieleszczenko-Kowszewicz, Wiktoria, Miłkowski, Piotr, Oleksy, Marcin, Piasecki, Maciej, Radliński, Łukasz, Wojtasik, Konrad, Woźniak, Stanisław, Kazienko, Przemysław

OpenAI has released the Chat Generative Pre-trained Transformer (ChatGPT) and revolutionized the approach in artificial intelligence to human-model interaction. Several publications on ChatGPT evaluation test its effectiveness on well-known natural language processing (NLP) tasks. However, the existing studies are mostly non-automated and tested on a very limited scale. In this work, we examined ChatGPT's capabilities on 25 diverse analytical NLP tasks, most of them subjective even to humans, such as sentiment analysis, emotion recognition, offensiveness, and stance detection. In contrast, the other tasks require more objective reasoning like word sense disambiguation, linguistic acceptability, and question answering. We also evaluated GPT-4 model on five selected subsets of NLP tasks. We automated ChatGPT and GPT-4 prompting process and analyzed more than 49k responses. Our comparison of its results with available State-of-the-Art (SOTA) solutions showed that the average loss in quality of the ChatGPT model was about 25% for zero-shot and few-shot evaluation. For GPT-4 model, a loss for semantic tasks is significantly lower than for ChatGPT. We showed that the more difficult the task (lower SOTA performance), the higher the ChatGPT loss. It especially refers to pragmatic NLP problems like emotion recognition. We also tested the ability to personalize ChatGPT responses for selected subjective tasks via Random Contextual Few-Shot Personalization, and we obtained significantly better user-based predictions. Additional qualitative analysis revealed a ChatGPT bias, most likely due to the rules imposed on human trainers by OpenAI. Our results provide the basis for a fundamental discussion of whether the high quality of recent predictive NLP models can indicate a tool's usefulness to society and how the learning and validation procedures for such systems should be established.

chatgpt, chatgpt answer, emotion, (16 more...)

doi: 10.1016/j.inffus.2023.101861

2302.10724

Country:

Asia > Russia (0.45)
North America > Cuba (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(29 more...)

Genre:

Research Report > New Finding (1.00)
Personal (1.00)

Industry:

Banking & Finance (0.92)
Law > Civil Rights & Constitutional Law (0.67)
Media (0.67)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.45)

arXiv.org Artificial IntelligenceMay-30-2023

GPT4GEO: How a Language Model Sees the World's Geography

Roberts, Jonathan, Lüddecke, Timo, Das, Sowmen, Han, Kai, Albanie, Samuel

Large language models (LLMs) have shown remarkable capabilities across a broad range of tasks involving question answering and the generation of coherent text and code. Comprehensively understanding the strengths and weaknesses of LLMs is beneficial for safety, downstream applications and improving performance. In this work, we investigate the degree to which GPT-4 has acquired factual geographic knowledge and is capable of using this knowledge for interpretative reasoning, which is especially important for applications that involve geographic data, such as geospatial analysis, supply chain management, and disaster response. To this end, we design and conduct a series of diverse experiments, starting from factual tasks such as location, distance and elevation estimation to more complex questions such as generating country outlines and travel networks, route finding under constraints and supply chain analysis. We provide a broad characterisation of what GPT-4 (without plugins or Internet access) knows about the world, highlighting both potentially surprising capabilities but also limitations.

large language model, machine learning, natural language, (18 more...)

2306.0002

Country:

Europe > United Kingdom > England > Greater London > London (0.28)
Europe > Austria > Vienna (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
(79 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Infrastructure & Services > Airport (1.00)
Transportation > Ground > Road (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceOct-26-2022, 09:54:31 GMT

Python NumPy Tutorial - 2022

So you've learned the basics of Python and you're looking for a more powerful way to analyse data? NumPy is what you need.NumPy is a module for Python that allows you to work with multidimensional arrays and matrices. In addition, NumPy includes support for signal processing and linear algebra operations. So if you need to do any mathematical operations on your data, NumPy is probably the library for you. In this tutorial, we'll show you how to use NumPy to its full potential. You'll learn more about arrays as well as operate on them using mathematical functions. NumPy, which stands for Numerical Python, is a library consisting of multidimensional array objects and a collection of routines for processing those arrays. Using NumPy, mathematical and logical operations on arrays can be performed. In this Python Numpy Tutorial, we will be learning about NumPy in Python, What is NumPy in Python, Data Types in NumPy, and more. NumPy in Python is a library that is used to work with arrays and was created in 2005 by Travis Oliphant.

ndarray, numpy, opération, (15 more...)

Technology:

Information Technology > Software > Programming Languages (0.96)
Information Technology > Data Science (0.69)
Information Technology > Artificial Intelligence (0.67)

#artificialintelligenceFeb-23-2022, 07:40:15 GMT

The Complete Data Structures and Algorithms Course in Python

Why are Data Structures and Algorithms important?

algorithm, binary tree, linked list, (8 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Setting > Online (0.83)
Education > Educational Technology > Educational Software > Computer Based Training (0.40)

Technology:

Information Technology > Software > Programming Languages (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.60)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)

#artificialintelligenceJun-28-2021, 13:10:44 GMT

Python lists, Numpy arrays and Pandas series

Let's say you have the odd numbers between 1 and 20 and you are storing them in the following ways: Lists, arrays and Pandas series look quite similar at a first glance, so people often ask -- why do we need different data structures? What are the pros and cons and use cases? The purpose of this brief article is to clear up some of that confusion. Lists are one of the 4 built-in data types in Python to store multiple items (3 other data types being dictionaries, tuples and sets). A single list can store multiple data types at once -- integers, floats, strings.

data type, numpy array, numpy array and panda series, (5 more...)

Technology:

Information Technology > Software > Programming Languages (0.39)
Information Technology > Data Science (0.39)
Information Technology > Artificial Intelligence (0.36)

#artificialintelligenceJun-26-2021, 17:30:29 GMT

TextGenie - Augmenting your text dataset with just 2 lines of code!

Often while developing Natural Language Processing models, we find it difficult to find relevant data. Previously, while developing our Intent Classifier, we used the CLINC150 Dataset that had 100 samples for 150 different classes. But, what if we needed even more samples? One more similar scenario was when I was working on a contextual assistant with Rasa. While creating the training data from scratch, I'd have to imagine different samples for each intent or ask my friends for some help.

augmented data, dataset, textgenie, (14 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.91)